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AN APPROACH TO COMPUTING DOWNWARD CLOSURES 


GEORG ZETZSCHE 


Abstract. The downward closure of a word language is the set of all (not 
necessarily contiguous) subwords of its members. It is well-known that the 
downward closure of any language is regular. While the downward closure 
appears to be a powerful abstraction, algorithms for computing a finite au¬ 
tomaton for the downward closure of a given language have been established 
only for few language classes. 

This work presents a simple general method for computing downward clo¬ 
sures. For language classes that are closed under rational transductions, it is 
shown that the computation of downward closures can be reduced to checking 
a certain unboundedness property. 

This result is used to prove that downward closures are computable for 
(i) every language class with effectively semilinear Parikh images that are 
closed under rational transductions, (ii) matrix languages, and (iii) indexed 
languages (equivalently, languages accepted by higher-order pushdown au¬ 
tomata of order 2). 


1. Introduction 

The downward closure L\. of a word language L is the set of all (not necessarily 
contiguous) subwords of its members. While it is well-known that the downward 
closure of any language is regular [l^l, it is not possible in general to compute 
them. However, if they are computable, downward closures are a powerful ab¬ 
straction. Suppose L describes the behavior of a system that is observed through 
a lossy channel, meaning that on the way to the observer, arbitrary actions can 
get lost. Then, L\, is the set of words received by the observer mr Hence, given 
the downward closure as a finite automaton, we can decide whether two systems 
are equivalent under such observations, and even whether one system includes the 
behavior of another. 

Further motivation for studying downward closures stems from a recent result 
of Czerwinski and Martens §|. It implies that for language classes that are closed 
under rational transductions and have computable downward closures, separability 
by piecewise testable languages is decidable. 

As an abstraction, compared to the Parikh image (which counts the number of 
occurrences of each letter), downward closures have the advantage of guaranteeing 
regularity for any language. Most applications of Parikh images, in contrast, re¬ 
quire semilinearity, which fails for many interesting language classes. An example 
of a class that lacks semilinearity of Parikh images and thus spurred interest in 
computing downward closures is that of the indexed languages 3] or, equivalently, 
those accepted by higher-order pushdown automata of order 2 [271] . 

It appears to be difficult to compute downward closures and there are few lan¬ 
guage classes for which computability has been established. Computability is known 
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OL-systems and context- 
and stacked counter au- 


for context-free languages and algebraic extensions [ 3 , 2f 
free FIFO rewriting systems [l], Petri net languages 11 
tornat a 1341 . They are not computable for reachability sets of lossy channel sys¬ 
tems [28( and Church-Rosser languages 14]. 

This work presents a new general method for the computation of downward 
closures. It relies on a fairly simple idea and reduces the computation to the 
so-called simultaneous unboundedness problem (SUP). The latter asks, given a lan¬ 
guage L C a* ■ • • a* , whether for each k £ N, there is a word a^ 1 ■ ■ ■ afj 1 £ L such 
that Xi,... ,x n > k. This method yields new, sometimes greatly simplified, algo¬ 
rithms for each of the computability results above. It also opens up a range of other 
language classes to the computation of downward closures. 

First, it implies computability for every language class that is closed under ratio¬ 
nal transductions and exhibits effectively semilinear Parikh images. This re-proves 


computability for context-free languages and stacked counter automata [34|, but 


also applies to many other classes, such as the multiple context-free languages [321] . 
Second, the method yields the computability for matrix grammars [9, lOj], a power¬ 
ful grammar model that generalizes Petri net and context-free languages. Third, it 
is applied to obtain computability of downward closures for the indexed languages. 


2. Basic notions and results 

If X is an alphabet, X* (X + ) denotes the set of (non-empty) words over X. The 
empty word is denoted by e £ X*. For a symbol x £ X and a word w £ X*, let 
|w;| x be the number of occurrences of x in w. If w £ X *, we denote by alph(w) the 
set of symbols occurring in w. For words u, v £ A'*, we write u A v if u = u± ■ ■ ■ u n 
and v = vguivi ■ ■ ■ u n v n for some Ui,..., u n , vg,..., v n £ X*. It is well-known that 
A is a well-quasi-order on X* and that therefore the downward closure 

Lf={u£ X* | 3v £ L : u A v} 

is regular for any L C X* [16]. If X is an alphabet, X® denotes the set of maps 
a: X —>• N, which are called multisets. For a,/3 £ A®, k £ N the multisets a + (3 
and k ■ a are defined in the obvious way. A subset of X® of the form 

{po + xi ■ hi 4-b x n ■ p n | a?i,..., x n > 0} 

for po,...,p n £ X® is called linear and pi,...,p n are its period elements. A 
finite union of linear sets is said to be semilinear. The Parikh map is the map 
T: X* —> X® defined by ^ r (u>)(x) = |w| x for all w £ X* and x £ X. We lift T to 
sets in the usual way: 'F(L) = {T(u) | w £ L}. If T(L) = ^(X), then L and K 
are said to be Parikh-equivalent. 

A finite automaton is a tuple (Q, X, E, qo, F), where Q is a finite set of states, 
X is its input alphabet, £CQxX*xQisa finite set of edges, qo £ Q is its initial 
state, and F C Q is the set of its final states. If there is a path labeled w £ X* 
from state p to q, we denote this fact by p q. The language accepted by A is 
denoted L(A). 

A (finite-state) transducer is a tuple (Q, X,Y, E, qo, F), where Q, X, qo, F are 
defined as for automata and Y is its output alphabet and E C Q x X* x Y* x Q is the 
finite set of its edges. If there is a path from state p to q that reads the input word 
u £ X* and outputs the word v £ Y*, we denote this fact by p u ' v y q. In slight 
abuse of terminology, we sometimes specify transducers where an edge outputs a 
regular language instead of a word. 
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For alphabets X,Y, a transduction is a subset of A'* x Y*. If A is a transducer 
as above, then T (A) denotes its generated transduction , namely the set of all pairs 
(u, v ) G X* x Y* such that qo u,v } / for some / G F. Transductions of the form 
T(A) are called rational. For a transduction T C X* x Y* and a language L C A*, 
we write Tf = {v G F* \3u € L: (u, v) G T}. A class of languages C is called a full 
trio if it is effectively closed under rational transductions, i.e. if TL G C for each 
IgC and each rational transduction T. 

Observe that for each full trio C and L G C, the language L j, is effectively con¬ 
tained in C. By computing downward closures we mean finding a finite automaton 
for Lf when given a representation of L in C. It will always be clear from the 
definition of C how to represent languages in C. 

The simultaneous unboundedness problem. We come to the central decision 
problem in this work. Let C be a language class. The simultaneous unboundedness 
problem (SUP) for C is the following decision problem: 

Given: A language L C a* • • • a* in C for some alphabet {ai,..., a n }. 

Question: Does LI equal a* ■ ■ ■ a* ? 

The term “simultaneous unboundedness problem” reflects the fact that the equality 
Lf = a* • • • a* holds if and only if for each k G N, there is a word a* 1 ■ ■ ■ a* n G L 
such that Xi ,..., x n > k. 

After obtaining the results of this work, the author learned that Czerwinski and 
Martens considered a very similar decision problem [s] . Their diagonal problem 
asks, given a language L C X* whether for each k G N, there is a word w G L 
with |w|a, > k for each x G X. Czerwinski and Martens prove that for full trios 
with a decidable diagonal problem, it is decidable whether two given languages 
are separable by a piecewise testable language. In fact, their proof only requires 
decidability of the (ostensibly easier) SUP. Here, Theorem [j] implies that in each 
full trio, the diagonal problem is decidable if and only if the SUP is. 

The following is the first main result of this work. 

Theorem 1. Let C be a full trio. Then downward closures are computable for C if 
and only if the SUP is decidable for C. 

The proof of Theorem [0 uses the concept of simple regular expressions. Let A' 
be an alphabet. An atomic expression is a rational expression of the form (x U e) 
with x G X or of the form (Xi U • • • U x n )* with aq,..., x n G X. A product is a 
(possibly empty) concatenation a\- ■ ■ a n of atomic expressions. A simple regular 
expression (SRE) is of the form pi U • • • U p n , where the pi are products. Given an 
SRE r, we write L(r) for the language it describes. 

Theorem [H employs the following result of Jullien [HI (which was later rediscov¬ 
ered by Abdulla, Collomb-Annichini, Bouajjani, and Jonsson !)• 

Theorem 2 (Jullien [Hj]). Simple regular expressions describe precisely the down¬ 
ward closed languages. 

We are now ready to prove Theorem [j]. 

Proof of Theorem Q. Of course, if downward closures are computable for C, then 
given a language L C a\ ■ ■ ■ a* in C, we can compute a finite automaton for L\, and 
check whether L\, = ■ ■ ■ a*. This proves the “only if” direction. 
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For the other direction, we first observe that the emptiness problem can be 
reduced to the SUP. Indeed, if LCI* and T is the rational transduction X* x {a}*, 
then TL C a* and (TL)i = a* if and only if L ^ 0. 

Now, suppose the SUP is decidable for C and let L C X*. Since we know that 
L-l is described by some SRE, we can enumerate SREs over X and are guaranteed 
that one of them will describe L\.. Hence, it suffices to show that given an SRE r, 
it is decidable whether L(r) = L\.. 

Since L(r) is a regular language, we can decide whether Li C L(r) by checking 
whether L\. (~l (X* \ L(r)) = 0. This can be done because we can compute a 
representation for L\. n (X* \ L(r)) in C and check it for emptiness. It remains 
to be shown that it is decidable whether L(r) C L\,. 

The set L(r) is a finite union of sets of the form {tro}4TT{rui}i • • • Y* {w n }4- for 
some Yi C X, Yi yf 0, 1 < i < n, and Wi £ X *, 0 < i < n. Therefore, it suffices 
to decide whether {wojiFi*{wi}| • • • Y* {«; n }4. C L\.. Since L\. is downward closed, 
this is equivalent to 

(1) woY* Wl • • • Y*w n C Li. 

For each i £ {1,..., n}, we define the word Wj = yi • • • Uk, where Y = {y \...., y^}. 
Observe that wqY^w\ ■ ■ - Y*w n C Li holds if and only if for every k > 0, there are 
numbers xi ,..., x n > k such that wqu^wi ■ ■ ■ u* n w n £ Li. Moreover, if T is the 
rational transduction 

T = {(wou^wi ■ ■■<"»„, ai 1 ■ ■ ■a* n ) \ xi,...,x n > 0}, 

then T(Li) = {a* 1 ■ ■ ■ a^ n | wqu^wi ■ ■ ■ u^ n w n £ Li}. Thus, eq. ([l|) is equivalent 
to (T(L|))4, = a} ■ ■ ■ a*, which is an instance of the SUP, since we can compute a 
representation of T(Li) in C. □ 

Despite its simplicity, Theorem [l] has far-reaching consequences for the com¬ 
putability of downward closures. Let us record a few of them. 

Corollary 3. Suppose C and V are full trios such that given L £ C, we can compute 
a Parikh-equivalent K £ T>. If downward closures are computable for T>, then they 
are computable for C. 

Proof. We show that the SUP is decidable for C. Given L £ C, L C a\ ■ ■ - a*, 
we construct a Parikh-equivalent K £ V. Observe that then 'I'(Ah).) = 

We compute a finite automaton A for Kf and then a semilinear representation of 
\I/(L(M)) = ^(A'l) = Then Li = a} ■ ■ ■ a* if and only if some of the linear 

sets has for each 1 < i < n a period element containing aj. Hence, the SUP is 
decidable for C. □ 


Note that if a language class has effectively semilinear Parikh images, then we 
can construct Parikh-equivalent regular languages. Therefore, the following is a 
special case of Corollary 0. 


Corollary 4. For each full trio with effectively semilinear Parikh images, downward 
closures are computable. 


Corollary 0, in turn, provides computability of downward closures for a variety 
of language classes. First^ it re-proves the classical downward closure result for 
context-free languages 0, 251 and thus algebraic extensions [2f| (see 34] for a simple 
reduction of the latter to the former). Second, it yields a drastically simplified proof 
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of the computability of downward closures for stacked counter automata , which was 
shown in 3-1] using the machinery of Parikh annotations. It should be noted, 
however, that the algorithm in [34| is easily seen to be primitive recursive, while 
this is not clear for the brute-force approach presented here. 

Corollary^ also implies computability of downward closures for multiple context- 
free languages (32||, which have received considerable attention in computational 
linguistics. As shown in [32j, the multiple context-free languages constitute a full 
trio and exhibit effectively semilinear Parikh images. 

Our next application of Theorem JT] is an alternative proof of the computability 
of downward closures for Petri net languages, which was established by Habermehl, 
Meyer, and Wimmcl 0- Here, by Petri net language, we mean sequences of tran¬ 
sition labels of runs from an initial to a final marking. Czerwinski and Martens @ 
exhibit a simple reduction of the diagonal problem for Petri net languages to the 
place boundedness problem for Petri nets with one inhibitor arc, which was proven 
decidable by Bonnet, Finkel, Leroux, and Zeitoun Q. Since the Petri net languages 
are well-known to be a full trio [21 ] , Theorem [l] yields an alternative algorithm for 
downward closures of Petri net languages. 

We can also use Corollary 0 to extend the computability of downward closures 
for Petri net languages to a larger class. Matrix grammars are a powerful formalism 
that is well-known in the area of regulated rewriting and generalizes numerous other 
grammar models 1,0- They generate the matrix languages, a class which strictly 
includes both the context-free languages and the Petri net languages. It is well- 
known that the matrix languages are a full trio and given a matrix grammar, one 
can construct a Parikh-equivalent Petri net language 0. Thus, the following is a 
consequence of Corollary yj. 


Corollary 5. Downward closures are computable for matrix languages. 

Finally, we apply Theorem [l] to the indexed languages. These were introduced 
by Aho [3(] and are precisely those accepted by higher-order pushdown automata 
of order 2 l 27]. Since indexed languages do not have semilinear Parikh images, 
downward closures are a promising alternative abstraction. 


Theorem 6. Downward closures are computable for indexed languages. 

The indexed languages constitute a full trio [3f, and hence the remainder of this 
work is devoted to showing that their SUP is decidable. Note that since this class 
significantly extends the OL-languages 0, Theorem^ generalizes the computability 
result of Abdulla, Boasson, and Bouajjani for OL-systems and context-free FIFO 
rewriting systems [lj. 

Theorem 0 has an interesting consequence for computability of downward clo¬ 
sures in general. We will observe the following. 

Proposition 7. Given an indexed language L C a*b*, it is undecidable whether 
there is an n £ N with a n b n £ L. 


First, this demonstrates that a slight variation of the SUP is already undecidable. 
More importantly, Proposition 0 means that in automata that have access to a 
higher-order pushdown of order 2 and a very simple type of counter, reachability 
is undecidable: Given a second-order pushdown automaton for L, one can use an 
additional counter to accept L fl {a n b n \ n > 0}. Here, it even suffices to use a 
blind counter (that is, one that can assume negative values and has to be zero in 
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accepting configurations [13J) or a reversal-bounded counter {2£j (that is, one that 
switches between incrementing and decrementing only a bounded number of times). 

This is in contrast to the frequently used fact that semilinearity is preserved by 
adding blind (or reversal bounded) counters: When an automata model guarantees 
effectively semilinear Parikh images, then the model obtained by adding blind coun¬ 
ters or reversal-bounded counters still enjoys this property. Of course, this is not a 
precise statement^ but this fact has been discovered for various notions of storage 
mechanisms [T? ^ 26|, 35|. Note that blind counters and reversal-bounded counters 
are equivalent 131 (see [22] for a translation that is economic in the number of 
counters). Theorem @ and Proposition 0 together imply that this preservation has 
no analog for downward closures: 

Adding blind (or reversal bounded) counters does not preserve com¬ 
putability of downward closures. 


3. Indexed languages 

Let us define indexed grammars. The following definition is a slight variatiorQ 
of the one from [lflj |. An indexed grammar is a tuple G = (N, T, /, P, S ), where N, 
T, and / are pairwise disjoint alphabets, called the nonterminals , terminals , and 
index symbols , respectively. P is the finite set of productions of the forms A —> w, 
A —>- Bf , Af —»■ w , where A,B £ N, f £ I, and w £ (N U T)*. We regard a word 
Afi ■ ■ ■ f n with /i, S / as a nonterminal to which a stack is attached. Here, 

/i is the topmost symbol and f n is on the bottom. For w £ (N U T)* and x £ I*, 
we denote by [w, x ] the word obtained by replacing each A £ N in w by Ax. A 
word in ( NI * U T)* is called a sentential form. For q,r £ ( NI * U T)*, we write 
q =>g f if there are words qi,q -2 £ (NI* U T)*, A £ N, p £ (N U T)* and x,y £ I* 
such that q — qiAxq 2 , r = qi[p, y]q 2 , and one of the following is true: 

(i) A —> p is in P, p £ (N U T)* \ T *, and y = x, 

(ii) A —> p is in P, p £ T*, and y = x = e, 

(iii) A —> pf is in P and y = fx, or 

(iv) Af -A p is in P and x = fy. 

The language generated by G is L(G) = {w £ T* \ S w}, where =>q denotes 
the reflexive transitive closure of =>g- 

We will often assume that our indexed grammars are in normal form , which 
means that every production is in one of the following forms: 

(i) A — > Bf , (ii) Af —> B , (iii) A —> uBv , (iv) A BC, (v) A —>• w, 

with A,B,C £ N, f £ /, and u,v,w £ T*. Productions of these forms are called 
push , pop , output , split, and terminal productions, respectively. The normal form 
can be attained just like the Chomsky normal form of context-free grammars. 

Example 8. Let G = (N, T, /, P, S) be the indexed grammar with N = {P, T, A , B}, 
T = {a,b}, I = {f,g}, and the productions 

S->Sf, S -+Sg, S -> UU, U^e, 

Uf -s- A, Ug —>• B, A -»• Ua, B -> Ub. 

Then it is easy to see that L(G) = {ww \ w £ {a, b}*}. 

J We require that a nonterminal can only be replaced by a terminal word if it has no index 
attached to it. It is easy to see that this leads to the same languages [331. 
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Derivation trees are always unranked trees with labels in NI* UTU {e} and a 
very straightforward analog to those of context-free grammars. If t is a labeled tree, 
then its yield , denoted yield (t), is the word spelled by the labels of its leaves. For 
an example for the grammar from Example see Figure [l|. 


Overview. The SUP for indexed grammars does not seem to easily reduce to a 
decidable problem. In the case L C a*, the SUP is just the finiteness problem, 
for which Hayashi presented a procedure using his pumping lemma 118]. However, 
neither Hayashi’s nor any of the other pumping or shrinking lemmas )l2l . 24 . mm 
appears to yield decidability of the SUP. Therefore, this work employs a different 
approach: Given an indexed grammar G with L(G) C a{ ■ ■ ■ a* , we apply a se ries 
of tran sformations, each preserving the simultaneous unboundedness (sections l3.3l 
to 13.7tl . These transformations leave us with an indexed grammar in which the 
number of nonterminals appearing in sentential forms is bounded. This al lows us 
to construct an equivalent finite-index scattered context grammar ('section 13.81) . a 
type of grammars that is known to exhibit effectively semilinear Parikh images. 


An undecidability result. Before proving decidability of the SUP for indexed 
languages, we prove Proposition 0. 

Proof of Proposition Q. We provide a reduction from the Post correspondence prob¬ 
lem (PCP), which asks, given an alphabet X and morphisms a,/3: X* —» {1,2}*, 
whether there is a w (E X + with a(w) = (3(w). It is well-known that this problem 
is undecidable j30| . 

For a word w £ {1,2}*, let v{w) G N be the number obtained by interpreting 
the word w as a 2-adic representation. This means, for w £ {1, 2}*, we have 

v(e) = 0, v{wl) = 2 • v(w) + 1, v(w2) = 2 • i/(w) + 2. 
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Given an alphabet X and two morphisms a,/3: X* —> {1,2}*, we shall construct 
an indexed grammar G with 

(2) L(G) = { o K«(w)V(0(«O) | w e x+}. 

Note that this establishes the proposition: Since the map v. {1, 2}* —>• N is injective, 
the equation 0) implies that L(G) (~l {a n b n \ n > 0} ^ 0 if and only if there is a 
w g X + with a(w) = (3(w). 

For the sake of simplicity of the other proofs, our definition of indexed gram¬ 
mars restricts the syntax of productions. To make the the description of G more 
convenient, we allow one more case as a shorthand: In the following, when we write 
Ax o —> Bx i ■ ■ ■ x n for nonterminals A, B and index symbols xq, ... ,x n , then this 
represents n + 2 productions, Ax o —» Z n . Zi —> Zi_\Xi for 1 < i < n, and Zq —>• B , 
where Zq, ..., Z n are nonterminals occurring nowhere else. 

The grammar G has nonterminals S, U, A, A, B, B (and those resulting from using 
shorthands), index symbols I = X U {1, 2} (we assume that X fi {1, 2} = 0), and 
terminals T = {a, b}. The first set of productions allows us to derive all sentential 
forms AwBw for w g X + . For each x g X, we have: 

S -+Ux, U -+Ux, U -> AB. 

We also have productions that allow the nonterminals A, A (B,B) to replace an 
index symbol x with a(x) (/3(x)): 

(3) Cx —> Ca(x) for each x € X and G g {A, A}, 

(4) Cx —t C(3(x) for each x € X and G G {B, B}. 

These guarantee that for every w € X* and G g {A, A}, the sentential forms 
Cw and Ca(w) derive the same terminal words. Analogously, for w g X* and 
G g {B,B}, the sentential forms Cw and C(3(w) derive the same terminal words. 

Together with |s|) and (0), the next set of productions turns the sentential form 
Aw and Bw into a u ^ w ^ and b u ^^ w ' ) \ respectively: For each G g {A, B }, we have 

(5) Gl^GG, G2 -4- GGG, Cd^CC. 

Finally, to obtain terminal words, we have the productions 

(6) A —>■ e A —>■ a, B —>• e, B —> b. 

It remains to be shown that our grammar meets the goal in eq. 0). Because of 
the productions 0) and 0), it suffices to show that for w g {1,2}*, the sentential 
form Aw (Bw) derives precisely one terminal word, namely a (feG^O))). p 0 r 
symmetry reasons, we only show that Aw derives precisely the word a^(“( w )) i 
We proceed by induction and strengthen the statement slightly. Namely, we 
claim that for wo, ■ ■ ■, w n g {1, 2}*, the sentential form AwqAwi ■ ■ ■ Aw n derives 
precisely the word a u ( w o)+™^ where m = 2^^ ■ We use noetherian induc¬ 

tion with respect to the set of finite sequences of natural numbers, ordered lex¬ 
icographically, and to the words wo,...,w n g {1,2}*, we assign the sequence 
(|u!o|,..., |in n |). Now the induction step is just the observation that for every deriva¬ 
tion step AwqAwi ■ ■ ■ Aw n =>- AzqAz\ ■ ■ ■ Az k with wq, ..., w n , Zo, ..., Zk g {1, 2}*, 
we have (|zo|, ■ • •, \zk\) < (|«4)|, ■ • ■ > l^nl) in the lexicographical order and also 

n k 

^o) + E 2KI = ^o) + E 2NI - 
2=1 2=1 
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This can be seen by inspecting the productions (0) and noticing that for all words 
w £ {1,2}*, we have 

v{lw) = 2H + v{w), is{2w) = 2 • 2 h + v(w). 

Moreover, the claim is true in the case vjq = ■ ■ ■ = w n = e. Indeed, the sentential 
form AA n can only derive the word a n and n = i /(e) + Y^i =i 2°. □ 


3.1. Triple construction. We begin with the triple construction, a standard tool 
in the theory of grammars that we will use on several occasions. Suppose we have an 
indexed grammar G = (N, T, /, P, S ) in normal form and a finite-state transducer 
A = ( Q , T, X, E, qo , F). The triple construction is usually employed to prove closure 
under rational transductions, i.e. to build a grammar Ga = {Na, T, I, Pa, Sa ) such 
that L(Ga) = V'L(G), where V = T(A). More precisely, Na consists of all triples 
(p, B, q) with p,q £ Q and B £ N and they satisfy: 

(7) (p, P, q)x =tg A y if and only if 3z £ T* : Bx =>£ z, p z ’ v y q. 

For the construction, we assume that the edges in A are all of the form (p, t, e, q ) 
or (p,e,x,q) for p, q £ Q 1 t £ T, x G X. Furthermore, we assume that A has only 
one final state. Suppose p,q £ Q, B £ N, and consider the languages 

= { v e X* | 3B -A u G P, uef, pAA+q in A}. 

Since each of these sets is regular, we can construct an automaton U with state set 
Q such that for each p,q € Q and B £ N, there are states r p ,B,q, s p ,B,q £ Q with 

r P ,B,q s Pt B,q in U if and only if w £ T p ,b, 9 


for w £ X*. We are now ready to describe the grammar Ga- Its set of nonterminals 
is Na = (Q x N x Q) U (Q x Q). The first type of productions are the following. 
For r, s £ Q and each edge r 4 f in P, we have a production (r, s) —>• x(t,s). 
Furthermore, for each s £ Q, we have the production (s, s ) —» e. Since these will 
be the only productions with left-hand side in Q x Q. we will have 


(r, s) =>*g a w if and only if r -—>■ s in U 

for r,s £ Q and w £ X*. For p,q,r £ Q , B,C : D £ N, g £ I, u,v £ T *, Ga has 
productions 


(p, B, q) (p, C, q)g 
(p,B,q)g -> (p,C,q) 

(p,B,q) -> u(p',C, q')v 

(p,B,q) (p, C,r)(r, D, q) 


for each P —»• Gp £ P, 

for each Pp -A G £ P, 

for each P -4- uGu £ P, p', q £ Q 

with p — > p , g — y g in A 

for each P —>• GP £ P. 


Moreover, it has the production (p, P,p) —>• (r Pt B,q, s Pt B,q) for each p,q £ Q and 
B £ N. Now it is easy to verify that eq. 0) is satisfied. Therefore, we let (q 0 , S, f) 
be the start symbol of Ga, where qo and / are the initial and the final state, 
respectively, of A. Then, in particular, we have L(Ga) = VL (G), where V = T(A). 
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3.2. Regular index sets. We now analyze the structure of index words that fa¬ 
cilitate certain derivations. Let G = (N, T, P, S ) be an indexed grammar, A £ N 
a nonterminal and R C T* a regular language. We write IW G (H, R) for the set of 
index words that allow A the derivation of a word from R. This means 

IW g (A,P) = {x £ r I 3y £ R: Ax =>* G v}- 

The following lemma is essentially equivalent to the fact that the set of stack con¬ 
tents from which an alternating pushdown system can reach a final configuration 
is regular [H]. We include here a proof in the terminology of indexed grammars. 

Lemma 9. For an indexed grammar G, a nonterminal A, and a regular language 
R, the language IW G (H, R) is effectively regular. 

Proof. Let G = (N, T, I , P, S). First of all, we may assume that G is in normal form, 
since bringing an indexed grammar in normal form does not affect the languages 
\\Na(A, R). Suppose A! is a transducer such that for some states p,q, we have 
p z ' v y q in A' if and only if z = y and z £ R. In particular, IW g (j4, R) equals 
IWg a , ((p, A,q),T*), where Ga 1 is obtained using the triple construction. Therefore, 
it means no loss of generality to assume that R = T*. Hence, we may discard the 
generated terminals and assume that in G, every production is of the form B —> Cf, 
Bf —> C, or B —> w with w £ N*. 

Our next step is to construct a grammar G' with the same nonterminal and index 
symbols as G such that (i) \\Ng'(A,R) = \\Nq(A,R) and (ii) if Aw => G , e, then e 
can be derived from Aw without using push productions. We do this by successively 
computing grammars Gi = (N, T, I, Pi, S) for i £ N such that Po C Pi C • • •. We 
initialize Po = P. Suppose Gi is already defined and that every productions in Pi 
is of the form B —> Cf, Bf —> C, or B —> w for some w £ N*. In the following, we 
say that a production is a nonterminal production if it is of the form B —> w with 
B £ N and w £ N + . Consider the language 

K® = {w £ N + | B «;}, 

where => G . is the restricted derivation relation that only permits nonterminal pro- 
ductions. Then K B is clearly context-free. Hence, we know that also the language 
Lp f = context-free, where Vf is the rational transduction that on in¬ 

put w = Bi ■ ■ ■ Bk, outputs all words C\ ■ ■ ■ Ck for which there are productions 
Bjf —> Cj in Pi for 1 < j < k. Observe that consists of all those sentential 
forms of Gi reachable from Bf, B £ N, f £ I, by first applying only nonterminal 
productions and then applying at each position a production that pops /: Since 
only allows nonterminal productions, the production sequence for w £ is 
applicable to Bf as well. (Recall that our definition of indexed grammars forbids 
the application of terminal productions to nonterminals with a non-empty index.) 

The context-freeness of L^/ a U° ws us to compute the set W B ^ C 2 N with 

W B j = {alph(w) | w £ L^b f}- 

The set W B j describes all combinations of nonterminals that can result when 
applying to Bf a number of nonterminal productions and then at each position a 
production popping /. We are ready to describe the productions in Pi+i- For each 
subset X C TV, we pick a word wx £ N* with alph(wx) = X and \wx\ = l-X’l- 
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We obtain P,+i by adding to Pi the production G —>• wx for each C € N and 
X £ such that C —> Bf £ Pi with B £ N, f £ I. 

Note that since we only add productions, we have I( A , R) C IWc i+1 {A, R) 
and the construction guarantees IWg 4 (A, R) = IW(j 4+1 (A, R): Since the added wx 

(i) 

contains all the nonterminals of the corresponding word in Lr B p a derivation of e 
in G i+ i can easily be turned into a derivation in G.; by replicating subtrees in the 
derivation tree. 

Since we only add productions of the form C —> w with |ie| < \N\, there must 
come an i with Pj + i = Pi. This means that for each C £ N and each u £ L B f 
such that C -A Bf £ Pi, we have some production C —>• w with alph(w) = alph(u). 
Therefore, in Gi, for A £ N, v £ I*, the sentential form Av can derive e if and 
only if it can do so without using push productions: For each derivation of e 
from Av that uses a push production, we can bypass this push production and all 
corresponding pop productions by using one of the added productions C —>• Wx- 
Thus, a derivation of e with a minimal number of occurrences of push productions 
has to avoid them altogether. 

This allows us to construct a finite automaton for IWg^A, P) rev . Here, for 
a language U C X*, U rev denotes the set of words from U in reverse. As the 
automaton reads index words from right to left, it maintains the set of nonterminals 
B for which the currently read suffix v satisfies Bv =>q. £ - The set of states of our 
automaton is therefore the power set of N and its initial state is 

qo = {B€N\B =*►'£ £}, 

where =>q. is the restricted derivation relation that only permits productions with 
a left-hand side in N and a right-hand side in N*. As transitions, the automaton 
has for every X C N and / £ I an edge 

X ———> {B £ N | W^ f C X} 

Note that we can again compute the initial state of the automaton using context- 
freeness arguments. The final states of the automaton are all those sets X C N that 
contain A. Then, the automaton clearly accepts IWg 4 (A, P) rev = IWg(A, P) rev . □ 

3.3. Interval grammars. We want to make sure that each nonterminal can only 
derive words in some fixed ‘interval’ a* ■ ■ ■ a*. An interval grammar is an indexed 
grammar G = (N,T, I, P, S) in normal form together with a map l: N —> N x N, 
called interval map, such that for each A £ N with l(A) = ( i,j ), we have 

(i) 1 <i<j<n, 

(ii) if Ax =>q u for x £ I* and u £ T*, then u £ a* • • • a*, and 

(iii) if S uAxvByw with u,v,w £ (NI* LIT)*, B £ N, x,y £ I*, and 
i(B) = ( k,i ), then j < k. 

Proposition 10. For each indexed grammar G with L(G) C ■ ■ ■ a*, there is an 
equivalent interval grammar. 

Proof. Let G = (N,T, I, P, S) be an indexed grammar in normal form such that 
L(G) C a* • • • a*. Our new grammar has nonterminals 

N' = {(*, A, j) | A £ N, 1 <i<j< n} 
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and productions 

(■for each Af ->■ B £ P, 

(i, A j) —)► (for each A^ Bf £ P, 

(i , A, j) —»• u(r, B, s)v for each A —» uBv £ P, i < r < s < j, 

u £ a* ■■■ a* and v £ a* ■ ■ ■ a *, 

(i, A,j) —> (i, B, k)(k, C,j ) for each A —> SC € P and i < k < j, 

(i , A, j) —>• w for each A —>■ w £ P with w £ a* ■ ■ ■ a* 

where A, B 7 C £ N and f £ I. As the new start symbol, we choose (1,5, n). Then, 
setting t((i,A,j)) = ( i,j) for each A £ N and i,j £ N clearly yields an interval 
grammar and its equivalence to G is easily verified. □ 


3.4. Productive grammars. We will also need our grammar to be ‘productive’, 
meaning that every derivable sentential form and every nonterminal in it contribute 
to the derived terminal words. A production is called erasing if its right-hand side 
is the empty word. A grammar is non-erasing if it contains no erasing productions. 
Moreover, a word u £ (NI* UT)* is productive if there is some v £ T* with u =>q v. 
We call an indexed grammar G productive if (i) it is non-erasing and (ii) whenever 
u £ ( NI*UT)* is productive and u u' for u' £ (NI*UT)* , then u' is productive 
as well. The following proposition is shown in two steps. First, we construct an 
interval grammar and then use Lemma @ to encode information about the current 
index word in each nonterminal. This information is then used, among other things, 
to prevent the application of productions that lead to non-productive sentential 
forms. The proposition clearly implies that the SUP for indexed grammars can be 
reduced to the case of productive interval grammars. 


Proposition 11. For each indexed grammar G with L(G) C one can 

construct a productive interval grammar G' with L(G') = L(G) \ {s}. 


The proof of Proposition [llj relies on a construction that is used again for the 
proof of Lemma [T~il. so we describe it for general indexed grammars. 

By Lernma^, the languages IWq(A, T + ) and IWg(A, {e}) are effectively regular. 
This means, we can construct a deterministic finite automaton that reads a word 
over I in reverse and, after reading the suffix u £ /*, maintains in its state the set 
of all nonterminals A with u £ IWg(A,T + ) as well as the set of those A for which 
u £ IW g (A, {e}). 

Let us formalize this. There is a finite set Q, an element qo £ Q, maps 
<7o> cr+ : Q —> N , and a map •: I x Q —> Q such that if we extend the latter map to 
•: I* x Q —» Q via ua ■ q = u ■ (a ■ q) and e • q = q for a £ I , u £ I* , q £ Q , then 


(u ■ qo) = {A £ N \ Au =^ G e}, 
cr+(it ■ qo) = {A £ N | 3i> £ T + : Au => G u} 


for each u £ I*. 

The idea behind the construction of G is to encode into each nonterminal the 
state in Q reached by reading its current index. Hence, as nonterminals, we have 
the set N = N x Q. In order to be able to update this state, we also need to 
encode such states into the index words themselves. Here, each index symbol 
will encode the state reached by reading the suffix to its right. Thus, the index 
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symbols in G are / = I x Q. Formally, we want to achieve the following. Let 
g: ( NI * U T)* —> ( NI* U T)* be the function with 

y(Afn ' ' ' /l) = (A, Qn)(fm Qn—l) ' ' ' (/lj 9o)j 
for fi £ I, where q, = /, • • • fiq 0 for 1 <i <n and 

g(u 0 AiWiu,i ■ ■ • AmWmUm) = u 0 g(A 1 w 1 )u 1 ■ ■ ■ g(A m w m )u m 

for uo,...,u m e T *, Ai,...,A m e N , e I*. Then, we want the 

grammar G to satisfy 


( 8 ) 


Aw v if and only if g(Aiu) =>A v 


for A e N, w e /*, and v e T + . G has the productions 


(A q)(f, q') —>■ {B,q’) 

(A, q) -t (B,f-q)(f,q) 
(A, q) -t u{B,q)v 
{A,q) ( B,q) 

(A, q) —> (C, q) 

{A,q) ( B,q)(C,q ) 

(A <7o) u 


for each Af -A B e P with B e <J + (q'), q = f ■ q', 

for each A —> Bf e P with B e cr + (/ • q), 

for each A —> uBv e P with B e cr +(q), 

for each A —*■ BC £ P with B € cr + (q), C £ cr 0 (g), 

for each A —> BC £ P with B £ cro(q), C £ cr+(q), 

for each A BC £ P with B £ cr+(q), C £ cr+(q), 

for each A —► u £ P with u £ T + . 


Furthermore, the start symbol of G is (S, qo) = g(S). Each of the directions of 
eq. (0) now follows by induction on the number of derivation steps. Hence, we have 
L(G) = L(G) \ {e}. Let us prove that G is productive. Suppose the partial function 
h: (NI* U T)* —> (NI* U T)* is defined as the restriction of g to those words 


X = UoAxWxU! ■ ■ ■ A m w m u m , 


(with uq, ..., u m £ T* , Hi,..., A m £ N, w i, ..., w m £ I*) where for each index 
i £ {1,... ,m}, the set cr+(wj • qo) contains A;. Then it follows from eq. (@) that 
every u £ \mh is productive. Furthermore, by induction on n, one can show that 
u =>% v, v £ T + , implies u £ im/i. Thus, u is productive in G if and only if 
u £ im h. Moreover, an inspection of the productions in G reveals that if u =>£ v 
and u £ imft., then v £ im h. Thus, if u £ (NI* U T)* is productive, then every 
sentential form reachable from u is productive. Hence, G is productive. 

Proof of Proposition [7^. Using Proposition [ToL we construct an interval grammar 
H with L (H) = L(G). Let H = (N,T, I, P, S) be obtained from H as above. Then 
H is productive and generates L (H) = L (H) \ {e}. We define t: N —> N x N by 
t((A, q)) = l(A). Then H, together with l, is clearly a productive interval grammar 
with 1(H) = L(G) \ {e}. □ 


3.5. Partitioned grammars. Our next step is based on the following observation. 
Roughly speaking, in an interval grammar, in order to generate an unbounded 
number of a, ! s, there have to be derivation trees that contain either 

(i) an unbounded number of incomparable (with respect to the subtree order¬ 
ing) cij-subtrees (i.e. subtrees with yield in a* ) or 

(ii) a bounded number of such subtrees that themselves have arbitrarily large 
yields. 
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In a partitioned grammar, we designate for each a*, whether we allow arbitrarily 
many aj-subtrees (each of which then only contains a single af) or we allow exactly 
one (^-subtree (which is then permitted to be arbitrarily large). The symbols of 
the former kind will be dubbed ‘direct’. 

Let us formalize this. A nonterminal A in an interval grammar is called unary 
if i(A) = ( i,i ) for some 1 < i < n. A partitioned grammar is an interval grammar 
G = (N, T , I, P, S) , with interval map i\ N ->NxN, together with a subset D CT 
of direct symbols such that for each a* £ T, the following holds: (i) If aj £ D, then 
there is no A £ N with i(A) = (i, i), and (ii) if a* ^ D and t is a derivation tree of 
G, then all occurrences of a,; are contained in a single subtree whose root contains 
a unary nonterminal. In other words, direct symbols are never produced through 
unary nonterminals, but always directly through non-unary ones. If, on the other 
hand, a* is not direct, then all occurrences of a* stem from one occurrence of a 
suitable unary symbol. The next proposition clearly reduces the SUP for indexed 
grammars to the case of partitioned grammars. 

Proposition 12. Let G be a productive interval grammar with L(G) C a* ■ ■ - a*. 
Then, one can construct partitioned grammars G i,..., G m such that L(G)J, equals 
a* • • • a* if and only if L(Gi)4_ = al ■ ■ • a* for some 1 < i < m. 


Proof. Suppose G is a productive interval grammar. We prove the proposition by 
constructing for each subset DCTa partitioned grammar Gd and then show that 
L(G)J_ = a* • • • a* if and only if L(Gd)4- = a\ • • • a* for some D CT. Observe that 
for n = 1, G is already a partitioned grammar with D = 0. Therefore, we may 
assume that n > 2. 

Since G is a productive interval grammar, we may assume that each of its pro¬ 
ductions is in one of the following forms: 

(i) A —x Bf with l(A) = ( i,j ), i(B) = (r, s) and i <r < s < j, 

(ii) Af —x B with i(A) = {i,j), i{B) = (r, s) and i < r < s < j, 

(iii) A —x uBv with l(A) = ( i,j), l(B) = (r, s), u £ a* ■ ■ ■ a*, v £ a* ■ ■ ■ a*, 

(iv) A —x BC with l(A) = (■ i,j ), i(B) = ( p,q ), l(C) = (r, s) and i < j < p < 


q < r < s < j, 

(v) A —> u with l(A) = ( i,j ) and u £ a* ■■■ a* 
with A,B,C £ N. A production that is not of thi s for m that is used in a derivation 
would allow the grammar to violate condition item lfiin of interval grammars. Hence, 
every production that is not in one of these forms can be safely removed. By 
introducing new intermediate nonterminals, we can therefore even assume that 
every production is in one the following forms: 

(i) A —> Bf with t(A) = l(B), 

(ii) Af —x B with i(A ) = t{B), 

(iii) A —x uBv with t(A) = (i,j ), l(B) = (r, s), u £ a* ■ ■ ■ a*, v £ a* ■ ■ ■ a*, 

(iv) A —> BC with l(A) = i(B) = ( p,q ), i(C) = (r, s) and i < j < p < 


q < r < s < j, 

(v) d->!i with t(A) = ( i,j ) and u £ a* ■■■ a* 

Suppose D CT. First, we construct the grammar G' D from G. Here, the essential 
idea is to replace each maximal subtree whose root has a label Ax with A £ N, 
x £ /*, i(A) = (*,*) and a* £ D by a single node labeled a*. The resulting trees 
are the derivation trees of G' D , which then has no nonterminals A with i(A) = (*, i) 
and ai £ D. 
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Because of our normal form, whenever a unary nonterminal is introduced in G 
that does not already stem from a nonterminal with the same /-value, the left-hand 
side of the production is a non-unary nonterminal. Hence, consider a production 
A —> w in G such that l(A) = ( i,j ) with i < j and w £ (NUT)*. Let w' £ (NUT)* 
be obtained from w by replacing each B £ N, l(B) = (k,k), a*, £ D , with the 
symbol a^. 

(i) If |w/|iv > 1, we add the production A —> w'. Note that then, A —»• w' 
can be applied whenever A —> w is applied (Recall that productions with a 
right-hand side in T* can only be applied when the index word is empty). 

(ii) If w' £ T*, the production A —> w' is not applicable when the A in the 
sentential form still carries a non-empty index word. In this case, we intro¬ 
duce a fresh nonterminal E, set l(E) = (i,j), and add productions A —»• E, 
Ef —> E for each f £ I, and E —>• w'. Then, whenever A —> w is applied, 
we can instead apply A —> E, then remove the index with Ef —> E , and 
finally apply E —>• w'. 

Moreover, we remove all nonterminals A with l(A) = (i,i), a,i £ D and all produc¬ 
tions containing such nonterminals 

Now in fact, the derivation trees of G' D are precisely those obtained from deriva¬ 
tions trees t of G by replacing every maximal subtree whose root is labeled Ax. 
A £ N, x £ I*, i(A) = (i,i), a,i £ .D, with a node labeled Oj and, if necessary, 
adding a path of productions Ef —> E. 

Note that since G is productive and l(A) = (i,i), every word derivable from A 
(together with an index word) is contained in af. Furthermore, every occurrence 
of A in a derivable sentential form of G is also able to derive a word in af. This 
means, for each u £ \-(G' D ), there is a word v £ L(G) with u A v. Hence, we have 
L(G' D )| c L(G);. 

Consider a derivation tree of G' D or of G. We call a node i-node if its label is a* 
or some A £ N with i(A) = (i,i). If, in addition, the f-node has no ?'-node as an 
ancestor, it is an i-root. A subtree whose root node is an i-root of the derivation 
tree is called i-subtree. 

As a second step, we construct Gd from G' D so that the following holds: The 
derivation trees of Gd are precisely those obtained from derivation trees of G' D by 
essentially deleting for each a* £ T\D all but one '/'-subtree (‘essentially’ because we 
have to rename the remaining nonterminals). Of course, if the deletion of subtrees 
leaves behind a leaf labeled with a nonterminal, we attach an e-labeled node below 
it. The construction of Gd is achieved by letting each nonterminal carry a function 
a: T \ D —> {0,1 ,u/}. Here, a(ai) = 1 indicates that the one allowed i-subtree is 
somewhere below the current node; a(a*) = 0 means that the i-subtree is located 
elsewhere in the derivation tree; and a(a,i) = ui indicates that the current node is 
part of the i-subtree. In particular, the new start symbol carries the function a 
with a(a,i) = 1 every a* £ T \ D. It is easy to adjust the productions to use and 
update these functions a. 

Now, every word in L(Gd) is obtained from a word in L (G' D ) by deleting for 
each a, £ T\ D the yields of all but one i-subtree. Hence, for each u £ L(Gd), 
there is a v £ L (G' D ) with u A v. Thus, we have L(GdH ^ L(G^)j, C L(G)),. This 
means, if L(GdH = a* • • • a*, then L(G)4- = a* • • • a*. It remains to be shown that 
if L(G)j, = a\ ■ ■ ■ a*, then there is some D C T with L(Gd) 4- = a* • • • a*. 
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Suppose L(G){. = a* •••a*. Then there is a sequence i 1 , * 2 , • - - of derivation 
trees of G such that a\-- -a* A yield (tk)- For each derivation tree t, let <Ti(t) be 
the number of i-subtrees in t. By Dickson’s Lemma, we can pick a subsequence 
tj,tl,, • ■ • of such that for 1 < i < n, (Ji is monotonically increasing on 

We claim that with 

D = {at £ T | ijj is unbounded on t[, t ' 2 ,...}, 

the grammar Gjj satisfies L(Gd){ = a* •••a*. By definition of D, we can find 

a subsequence t ", t "■... of ■ ■ ■ such that &i(t k ) > k for every as* £ D. Let 

Sk = t k be the derivation tree of G' D corresponding to t k of G as above. Then 
/ *// \ 

we have a\ A k> < yield(sfc) for Oj G D. Since we only change ^-subtrees for 
Oj £ D when going from t k to Sk, we still have a\ < yield(s*,) for a* £ T\D and 
thus a\---a^ A yield (s^). 

The choice of D guarantees that <r r is bounded on si, S 2 , ■ ■ ■ for every at £T\D. 
Hence, there is an i £ N with <Ti{sk) < l for every k £ N and a,; £ T\D. This means, 
if Ti (t) is the maximal length of a yield of an i-subtree of t, then r, is unbounded on 
Si, S 2 , ■ ■ • for each aj £ T\D. Indeed, if t.\ were bounded on s\, S 2 , ■ ■ ■ by B £ N, then 
yield(sfc) would contain at most £-B occurrences of eq for at £T\D , contradicting 
a\ -< yield(sfe). We can therefore hnd a subsequence s^, s ' 2 ,... of Si, S 2 ,... such that 
Ti(s ' k ) > k for <n £ T\D. Note that since this is a subsequence of Si,S 2 , • •it 
automatically satisfies a\ yield(s^.) for Oj £ D. 

Let us now turn the trees s' 2 , ■ ■ ■ into derivation trees ri, r 2 ,... of Gd- We do 
this by deleting, for each cq £ T\D, from s' k all i-subtrees but the one with the 
longest yield (and renaming the remaining nonterminals to obtain derivation trees 
of Gd). Again, if this deletion leaves behind a leaf labeled by a nonterminal, we 
attach an e-labeled node beneath it. Clearly, each r k is a derivation tree of Gd- 
Observe that a\- ■ ■ a 1 ^ A yield(r-fc). Indeed, if a* £ D , then a\ A yield(s' fc ) and thus 
a* A yield (r-fc). If a, £T\D, then Ti{s' k ) > k and hence a\ A yield(r-fc). Therefore, 
we have L(Gd)| = «*••• a*. □ 

3.6. Constructing transducers. The last step in our proof ((section [.'{.8l l will be 
to solve the SUP in the case where we have a bound on the number of nonter¬ 
minals in reachable sentential forms. The only obstacle to such a bound are the 
unary nonterminals corresponding to terminals at £ D: All other nonterminals 
have l(A) = (i,j) with i < j and there can be at most n — 1 such symbols in a 
sentential form. However, for each ^ D , there is at most one subtree with a 
corresponding unary nonterminal at its root. Our strategy is therefore to replace 
these problematic subtrees so as to bound the nonterminals: Instead of unfolding 
the subtree generated from u £ NI*, we apply a transducer to u. 

In order to guarantee that the replacement does not affect whether L(G)), equals 
a* • • • a*, we employ a slight variant of the equivalence that gives rise to the cost 
functions of Colcombet [6|. If /: X —>■ N U {oo} is a partial function, we say that 
/ is unbounded on E C X if for each k £ N, there is some x £ E with f(x) > k 
(in particular, f(x) is defined). If g: X —>• N U {oo} is another partial function, 
we write / ss g if for each subset EC X, we have: / is unbounded on E if 
and only if g is unbounded on E. Note that if h: Y —> X is a partial function 
and f ~ g, then h o f « h o g. Now, we compare the transducer and the original 

2 The difference is that we have an equivalence on partial instead of total functions. 
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grammar on the basis of the following partial functions. Given an indexed grammar 
G = (N, T, I, P, S) and a transducer A with T (A ) C NI* x T*, we define the partial 
functions fa, f a '■ NI* ->NU { 00 } by 

fa{u) = sup{|u| | v £ T*, u =>g v}, 
f A (u ) = sup{|u| I V £ T*, (u,v) £ T(A)}. 

Note that here, sup M is undefined if M is the empty set. 

Proposition 13. Given an indexed grammar G, one can construct a finite-state 
transducer A such that fA~fG- 

Productivity. In the proof of Proposition!!^, we assume that G is productive. The 
following lemma justifies this. A partial function h: X* —> Y* is called rational if 

{{u,v)£X*xY*\f(u) = v} 

is a rational transduction. 

Lemma 14. Given an indexed grammar G, one can construct a productive gram¬ 
mar G' and a rational partial function h such that /gRi/io /g'. 

Proof. Consider the grammar G and the partial function h constructed in the proof 
of Proposition [HI. Since G is productive and h is clearly rational, it suffices to show 
that /io/ 6 «/ G . 

Note that h is defined on u G NI* if and only if there is some v G T + with 
u =>q v. This means, u G dom h if and only if /g(m) > 1. Furthermore, if 
u G dom h , then eq. (@) implies that /^(^(zi)) = /g(u). Hence ho fg and /g agree 
on dom h and are both bounded on NI*\dom h. This clearly implies hof^ ~ /g- □ 

Now it suffices indeed to prove Proposition [l3| for productive grammars: If we 
can construct a finite-state transducer A with /A « /g' and A' is the transducer 
that first computes h and then applies A , we have f a 1 = h o f a ~ h o fQi ze fQ. 
Hence, we assume that G is productive. 

The construction of the transducer will involve deciding the finiteness problem 
for indexed languages, which asks, given G, whether L(G) is finite. Its decidability 
has been shown by Rounds [3lJ (and later again by Hayashi [hI Corollary 5.1]). 

Theorem 15 (Rounds [31]). The finiteness problem for indexed languages is de¬ 
cidable. 

Let R = {Bw | B G N, w G /*, w G \\Ng(B,T*)}. Then /g is clearly undefined 
on words outside of R. Therefore, it suffices to exhibit a finite-state transducer A 
with /a|h ~ /g : The regularity of R means we can construct a transducer A' with 
f A ' = /a|r- In order to prove the relation f A \R ~ fa, w e employ the concept of 
shortcut trees. 

Shortcut trees. Note that since G is productive, the label £ does not occur in 
derivation trees for G. Let t be such a derivation tree. Let us inductively define 
the set of shortcut trees for t. Suppose t’s root r has the label £ G NI* U T. If 
£ € N LIT, then the only shortcut tree for t consists of just one node with label £. 
If l = Bfv, B £ N, f £ I, v £ I*, then the shortcut trees for t are obtained as 
follows. We choose a set U of nodes in t such that 

(i) each path from r to a leaf contains precisely one node in U , 

(ii) the label of each x £ U either equals Cv for some C G IV or belongs to T, 
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Figure 2. Example of a derivation tree and two possible shortcut 
trees. The trees above are two drawings of the same derivation 
tree. In each of the derivation trees, the boxed nodes induce nodes 
in the shortcut tree below it. The shading of each boxed node 
indicates the level of its corresponding node in the shortcut tree. 
Here, A, B, C, D are nonterminals, /, g , h are index symbols, and 
a, 6, c are terminals. 


(iii) each node on the path from r to any x £ U has a label of the form Cuv 
with C £ N and u £ I*. 

For each such choice of U = {xi, ... ,x„}, we take shortcut trees for the 

subtrees of Xi,... ,x n and create a new shortcut tree for t by attaching ti ,..., t n 
to a fresh root node. The root node carries the label B. This is how all shortcut 
trees for t are obtained. For an example of a shortcut tree for a derivation tree, see 
Figured. Note that every shortcut tree for t has height |^| — 1. We also call these 
shortcut trees from l. 

In other words, a shortcut tree is obtained by successively choosing a sentential 
form such that the topmost index symbol is removed, but the rest of the index is not 
touched. For example, the chosen sentential forms in Figured are Afg , CgbDgcAg , 
and aBbDcC on the left-hand side and Afg , aAgbDgcAg , and aBbBDcC on the 
right-hand side. Note that if t is a shortcut tree for a derivation tree t, then we 
have |yield(f)| < |yield(t)|. On the other hand, every derivation tree has a shortcut 
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tree with the same yield. Thus, if we define fa ■ NI* ->NU { 00 } by 

Ig{u) = sup{|yield(t)| | t is a shortcut tree from 11 } 

then we clearly have /g ~ /g- Therefore, in order to prove / a\r ~ /g 5 it suffices 
to show Ja\r ~ /g- Let us describe the transducer A. For B,C £ N and g £ /, 
consider the language Lb, 9 ,c = {w £ (N U T)* \ Bg w, |w|g > 1}- Here, =>' G 
denotes the restricted derivation relation that forbids terminal productions. Then 
Lb, 3 ,c is the set of words w(root(t)) for shortcut trees i of derivation trees from Bg 
(or, equivalently, Bgv with v £ I*) such that C occurs in w(root(t)). Here, root(t) 
denotes the root node of t and w(root(f)) is the word consisting of the labels of the 
root’s child nodes. Each Lb, 9} c belongs to the class of indexed languages, which 
is a full trio and has a decidable finiteness and emptiness problem. Hence, we can 
compute the following function, which will describe H’s output. Pick anaeT and 
define for each B,C £ N and g £ I: 

{ {a}* if LB, g ,c is infinite, 

{a} if L B , g ,c is finite and L B , g ,c H (N U T)- 2 ^ 0, 

{e} if Ls, g ,c 7 ^ 0 and Ls, g ,c Q N UT, 

0 if Ls, g ,c = 0- 

Note that for each B,C £ N and g £ I, precisely one of the conditions on the 
right holds. The transducer A has states {go} U N and edges (go, B, {er}, B) and 
{B, g. Out (B, g, C ), C) for each B,C £ N and g £ I. A's initial state is go and its 
final states are all those B £ N with B => G w for some w £ T*. Hence, the runs 
of A on a word Bw £ R correspond to paths (from root to leaf) in shortcut trees 
from Bw. Here, the productivity of the words in R guarantees that every run of A 
with input from R does in fact arise from a shortcut tree in this way. 

Suppose A performs a run on input Bw £ R, |io] = k, and produces the outputs 
a ni ,..., a nk in its k steps that read w. Then the definition of Out(-, •, •) guarantees 
that there is a shortcut tree t such that the run corresponds to a path in which the 
*-th node has at least rij + 1 children. In particular, t has at least ni + • • • + nu 
leaves. Therefore, we have / a{Bw ) < fc(Bw). 

It remains to be shown that if /g is unbounded on E C R, then /a is unbounded 
on E. For a tree t, let S(t) denote the maximal number of children of any node and 
let f3(t) denote the maximal number of branching nodes (i.e. those with at least two 
children) on any path from root to leaf. We use the following simple combinatorial 
fact, for which we do not provide a proof. 

Lemma 16. In a set of trees, the number of leaves is unbounded if and only if 5 is 
unbounded or /3 is unbounded. 

Suppose /g is unbounded on E C R. Then there is a sequence of shortcut trees 
from words in E such that |yield(ti)|, |yield(f 2 )|, • • ■ is unbounded. This 
means <5 or /3 is unbounded on t\,t 2 ,.-.- Note that if t is a shortcut tree from 
Bw £ R , then the path in t with /3(f) branching nodes gives rise to a run of A 
on Bw that outputs at least /3(f) symbols. Hence, }a{Bw) > /3(f). Thus, if /3 is 
unbounded on fi, f 2 ,..then f a is unbounded on E. 

Suppose 6 is unbounded on t\, ti, ■ ■ ■■ Let x be an inner node of a shortcut tree 
t. Then the subtree of x is also a shortcut tree, say of a derivation tree t from 
Bgw £ R with B £ N, g £ I, w £ I*. Moreover, x has a child node with a label 
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C £ N (otherwise, it would be a leaf of t). We say that (B,g,C) is a type of x 
(note that a node may have multiple types). Since S is unbounded on 
and there are only finitely many possible types, we can pick a type (B, g,C) and a 
subsequence t\, t' 2 , ■ ■■ such that each t’ k has an inner node Xk with at least k children 
and type ( B,g,C ). This means there are nodes of type ( B,g,C ) with arbitrarily 
large numbers of children and hence Ls, g ,c is infinite. We can therefore choose any 
t\ and a run of A that corresponds to a path involving x t . Since Lb,q,c is infinite, 
this run outputs {a}* in the step corresponding to X;. Moreover, this run reads 
a word in E and hence f a is unbounded on E. This proves Ja\r ~ /g and thus 
Proposition H3. 


3.7. Breadth-bounded grammars. A breadth-bounded, grammar is an indexed 
grammar, together with a bound k £ N, such that each of its reachable sentential 
forms contains at most k nonterminals. Proposition [13| allows us to prove the 
following. 


Proposition 17. Let G be a partitioned grammar with L(G') C a* •••a*. Then, 
one can construct a breadth-bounded grammar G' with L(G') C a* • • • a* such that 
L(G))_ = a\ ■ ■ - a* if and only if L(G')4- = a* • • • a*. 


The proof comprises two steps. First, we build a breadth-bounded grammar 
that, instead of unfolding the derivation trees below unary nonterminals, outputs 
their index words as terminal words, which results in a breadth-bounded grammar. 
Then, we apply our transducer from Proposition[l3| to the resulting subwords. Since 
the breadth-bounded grammars generate a full trio, the proposition follows. The 
former is a matter of inspecting the triple construction. 


Lemma 18. The languages generated by breadth-bounded grammars form a full 
trio. 


Proof. Let G be a breadth-bounded grammar and A be a finite-state transducer 
with V = T ( A ). In order to prove the lemma, we need to exhibit a breadth-bounded 
grammar that generates VL(G). Consider the grammar Ga resulting from the triple 
construction (see section 13.11 1. 

Since Ga generates VL(G), it suffices to show that Ga is breadth-bounded. This, 
however, follows directly from the breadth-boundedness of G: Every sentential form 
of Ga is obtained from a sentential form of G by replacing nonterminals B £ N by 
symbols ( p,B,q ) with p, q £ Q or by symbols (r, s) with r,s £ Q. Hence, if in G, 
every sentential form contains at most k nonterminals, this is also true of Ga- Q 

Let G = (N, T, I , P, S) be a partitioned grammar with direct symbols D C T. 
We will be interested in derivations where the unary nonterminals are not rewritten. 
Therefore, we have the derivation relation =>g,d, in which u =>g,d v if and only if 
u =>g v and the employed production does not replace a unary nonterminal. This 
allows us to define 

PL(G) = {w£ ( UI* UP)* | A =>fj tD w}, 

where U C N is the set of unary nonterminals. Note that since G is partitioned, 
all unary symbols A have c{A) = (i,i) with tq ^ D. 

Lemma 19. For each partitioned grammar G, one can construct a breadth-bounded 
grammar G' with L(G') = PL(G). 
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Proof. Suppose G = ( N , T, I, P, S) is a partitioned grammar with direct symbols 
D C T and with L(G) C a* • • • a*. Let U C N be the set of unary nonterminals. 
We will use the new terminal symbols I = {f \ f € 1} and U = {A \ A € U}. If 
h: (U U / U T)* —> (U U I U T)* is the morphism such that h(a) = a for a £ T 
and h(x) = x for x £ U U I, then it clearly suffices to construct a breadth-bounded 
grammar G' with L(G') = h(PL(G)). Hence, our grammar will be of the form 
G' = (N',UU JUT, I , P',S). 

The new set of nonterminals is N 1 = N U {Zj for some fresh symbol Z. The 
productions of G' are obtained as follows. First, we remove from G all productions 
where the nonterminal on the left-hand side is in U. Then, we add for each A £ U 
the production A —»• AZ and for each / £ I the production Zf —> fZ and Z —)■ e. 
Hence, the new productions just output the nonterminal and then the index word 
(over a disjoint alphabet). 

It remains to be shown that G' is breadth-bounded. Let u be a sentential form 
of G'. Since G is partitioned, we have |m|c/u{z} < n - Moreover, there is a sentential 
form v of G with |v|jv\i/ = |it|iv\iy- Suppose A ±,..., A m are the non-unary nonter¬ 
minals in v. Since G is an interval grammar, we have t (Ai) = (ly, Sj) for 1 < i < m 
such that 1 < ri, s m < n, 7"i < Sj for 1 < z < m, and s,- < ?y + i for 1 < i < to. This 
implies m < n. Therefore, |zt|jv\i/ < Mat\u < ti. Hence, we have |u|jv' < 2 n. This 
proves that G' is breadth-bounded. □ 


We are now ready to prove Proposition [TtI. 


Proof of Proposition Q3- Suppose G = (N, T, J, P, S) and D C T is the set of direct 
symbols. Without loss of generality, we assume that T = {ai,..., a„. j and that for 
some m < n, we have D = {ai,..., a m }. First, we use Lemma 1 13 to construct a 
breadth-bounded grammar G" with L(G") = PL(G). This means L(G") consists of 
words 


^1 * * * ^rn B1W1 • • • 

where Bi £ N , Wi £ I*, and z(Pi) = (m + i,m + i) for 1 < i < n — m. 

Let A be the transducer provided by Proposition [l3| with ~ fa- We may 
clearly assume that A always outputs words in a* for some a € T. From A, we 
construct the transducer A' that, on the input word 


* * * Q’rnf B\W\ • • • Bn—rnWn—mt 


Pi,..., B n _ m € N, wi,, Wn-m £ I*, outputs all those words 


“1 ■ ■ ■ a m a m +1 

for which ( B t Wi , a Vi ) £ T(A) for 1 < i < n — m. 

Let V = T(A'). According to Lemma Hil. we can compute a breadth-bounded 
grammar G' for V(PL(G)) = V(L(G")). We claim that L(G')j. = a* ■ ■ - a* if and 
only if L(G')4_ = a\ ■ ■ ■ a*. 

Suppose L(G)j_ = a\ ■ ■ ■ a*. Then for each k £ N, there is a word ai 1,fc • • • aff” k 
in L(G) such that Xi : k > k. This means, there are words 

ai 1,fe • • • affT' k Bi,kWi,k -B n—m,k'Wn—m,k £ PL(G) 


■ - a. 


Vn—n 


such that Xi t k > k for 1 < i < m and fG(Bi,k w i,k ) ^ fc for 1 < i < n — m. Since 
Ia ~ /gj this sequence of words has a subsequence 


X 

Oi 


l,fc 


1 * CLm 


c TDI 

^1, 


k w l.k 


r)f / 

’ ^n—m.kr^n—r 


k 
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such that f A (B[ k w[ k ) > k for each k € N. Since this is a subsequence, we still 
have x\ k > k for 1 < i < m and fc ( k w[ k ) > k for 2 < i < n — m. If we repeat 
this picking of subsequences another n — m — 1 times, we arrive at a sequence of 
words 

• • • drn • • • B n — m ^W n — 

in which Xi t k > k for 1 < i < m and fA(Bi t kW n - m ,k) > k for 1 < i < n — m. By 
definition of G', this yields words 

n Sl ’ k . .. n Sm ’ k n Vl ’ k . . c I /f'l 

&ra ^ m -\-1 O ' 71 ) 

such that y h k > k for 1 < i < n — m. Hence, L(G')i = a* • • • a*. It can be shown 
completely analogously that L(G')4- = a* • • • a* implies L(G)| = a* • • ■ a*. □ 

3.8. Semilinearity. We have thus reduced the SUP for indexed grammars to the 
special case of breadth-bounded grammars. The last step in our proof is to prove 
the following. It clearly implies decidability of the SUP. 

Proposition 20. Languages generated by breadth-bounded grammars have effec¬ 
tively semilinear Parikh images. 

The basic idea of Proposition is to use a decomposition of derivation trees 
into a bounded number of ‘slices’, which are edge sequences of either (i) only push 
and output productions (‘positive slice’) or (ii) only pop and output productions 
(‘negative slice’). Furthermore, there is a relation between slices such that the index 
symbols that are pushed in a positive slice are popped precisely in those negative 
slices related to it. One can then mimic the grammar by simulating each positive 
slice in lockstep with all its related negative slices. This leads to a ‘finite index 
scattered context grammar’. This type of grammars is well known to guarantee 
effectively semilinear Parikh images [9j. 

Suppose G is a breadth-bounded indexed grammar. Since G can be brought into 
normal form while preserving the property of breadth-boundedness, we assume G 
to be in normal form. Let t be a derivation tree for G. An edge in t that connects 
nodes x and y is called chain edge if x and y both have a label in NI* and y is the 
only child of x that has a label in NI*. A non-empty sequence of chain edges that 
forms a path is called a chain. A maximal chain (i.e. that cannot be prolonged on 
either side) is called a segment. An edge in t corresponding to a push, pop, or output 
production is called a push edge , pop edge, or output edge, respectively. A chain 
that contains only push and output edges is called a (positive) phase. Similarly, if a 
chain contains only pop and output edges, it is a (negative) phase. For an example 
of a derivation tree and the decomposition into segments and phases, see Figure 0. 
The figure also shows an arrow collection and the decomposition of phases into 
slices; these concepts will be defined later. 

We call a segment two-phased if it consists of a negative phase followed by a 
positive phase. In other words, this requires that in the segment, there is no pop 
production applied anywhere below a push production. For example, the derivation 
tree in Figured has only two-phased segments. If every segment in every derivation 
tree of a grammar G is two-phased, we say that G is two-phased. Moreover, we 
call an indexed grammar quasi-left-linear if every output production is of the form 
A —> vB, i.e. terminal words are only output on the left. 

Lemma 21. For each breadth-bounded grammar G, one can construct a Parikh- 
equivalent breadth-bounded grammar G' that is two-phased and quasi-left-linear. 
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Proof. Let G = ( N , T, I, P, S) be breadth-bounded. First of all, by replacing each 
production A uBv , A, B £ N , u,v £ T*, with the production A —> uvB , we 
obtain a Parikh-equivalent quasi-left-linear grammar. Hence, we may assume that 
G is quasi-left-linear. We will use the restricted derivation relation =^G,iin, which 
requires that the applied productions are part of a segment. This means, we have 
x =>g, lin !J if V is obtained from x by applying a push, pop, or output production. 

We exploit the fact that derivations as above are essentially runs in a pushdown 
automaton: The nonterminal can be regarded as a state, its index as a stack content, 
and the generated terminal words correspond to the input of the automaton. In 
particular, for each A,B £ N, the language 

La,b = {u £ T* | A =>G ij n uB} 

is context-free. Hence, we can construct a finite automaton Ca.b whose language 
is Parikh-equivalent to La.b- Using the automata Ca,b , we will construct the new 
grammar G'. 

The grammar G' is obtained as follows. We assume that the state sets of all 
the automata Ca.b are pairwise disjoint and add each of their states as a new 
nonterminal. For each edge (p, a , q), we add the production p —» aq. Moreover, we 
add the production A —» <70 for the initial state qo of Ca.b and a production / — > B 
for each final state / of Ca.b■ Of course, since G is breadth-bounded, G' is as well. 

We clearly have ’P(L(G')) = ’P(L(G)) and we shall prove that for each w £ L(G'), 
there is a w' £ L(G') that satisfies \l/(u/) = ’F('iu) and can be derived using only two- 
pliased segments. The latter property will be refered to as two-phase completeness. 
We call two segments equivalent if they have the same initial and final nonterminal, 
the same effect on the index, and generate terminal words with the same Pariklr 
image. Suppose w £ L(G'). We choose for w a derivation tree t for G' such 
that in each segment, the number of push or pop productions is minimal among 
all equivalent segments. In other words, no segment in t can be replaced by an 
equivalent one so that the number of push or pop productions in this segment 
strictly decreases. We claim that then t has only two-phased segments. 

Suppose t had a segment that is not two-phased. This means, it contains a push 
production and, somewhere below, a pop production. Then, somewhere in between, 
there is a push production, followed by some output productions and a matching 
pop production. Here, ‘matching’ means that the pop production removes the 
index symbol added by the push production. Let A be the nonterminal to which 
the push production is applied and let B be the nonterminal that results from the 
pop production. Since push and pop productions involve only nonterminals that 
are already present in G, this means A,B £ N. We can therefore replace the chain 
between the H-node and the H-node by productions simulating a run of Ca.b- This 
strictly reduces the number of push or pop productions and thus contradicts the 
choice of t. Thus, every w £ L(G') can be derived using derivation trees where all 
segments are two-phased. 

We can now easily turn G' into a breadth-bounded grammar G" that does not 
allow segments that are not two-phased. This can be achieved by endowing the 
nonterminals of G' with an extra bit that indicates whether the current segment 
already contains a push production. If the latter is the case, no pop production is 
allowed for the rest of the segment. Then, because of the two-phase completeness, 
G" is equivalent to G' and hence Parikh-equivalent to G. Clearly, G" inherits 
breadth-boundedness from G' and is two-phased. □ 



24 


GEORG ZETZSCHE 


We can now prove Proposition!^ by showing that every quasi-left-linear breadth- 
bounded grammar can be turned into an equivalent grammar from a class for which 
effective semilinearity is well-known. This type of grammar is called ‘finite index 
scattered context grammar’. 

A scattered context grammar is a tuple G = (N,T, P, S), in which N and T are 
disjoint alphabets of nonterminal and terminal symbols, respectively, S G N is the 
start symbol, and P is a finite set of sequences 

(A\ —twi,..., A n —> w n ) 

of context-free productions, i.e. A,; G N and Wi G (N U T)* for 1 < i < n. 
We apply a sequence by applying all its productions in parallel to the current 
sentential form. Formally, we have x =>g y if there is a production sequence 
{A\ —> wi,..., A n —> w n ) G P and a permutation n of {1,..., n} such that 

X X()A 7 T (i'yX'l * * • Aff^n^^n, y XqW^^i^Xi * * * 

for some Xq, ..., x n G (N U T)*. The language generated by G is then 

L(G) ={w€T*\S =>* G w}. 

The grammar G is said to have finite index if there is a number BgN such that 
each w G L(G) has a derivation S =>g wi =>g ■ • • =>g w n = w such that |w)i|jv < B 
for each 1 < i < n. It is well-known (and not hard to see) that languages generated 
by finite index scattered context grammars are effectively semilinear |9J. 

Lemma 22. Given a two-phased quasi-left-linear breadth-bounded grammar, one 
can construct an equivalent finite index scattered context grammar. 

Our proof of LemmaEi requires the decomposition of derivation trees into slices. 

Slices. Suppose t is a tree with edge set E. An arrow collection for t is a finite 
set A together with two maps vq, : A —> E. If uq (a) = e and iq(/) for edges e, / 
of t, we call e and / the source and target of a and a is an arrow from e to f. If 
t is a derivation tree of a breadth-bounded grammar G such that all segments of t 
are two-phased, we endow it with an arrow collection: From each push edge e, we 
draw an arrow to each of the pop edges that remove the index symbol created by 
e. If e is a push edge, its type is the set of phases at which arrows from e arrive. 

Since G is breadth-bounded, we have an upper bound on the number of segments 
in derivation trees: If k is a bound on the number of nonterminals in sentential 
forms, then a derivation can contain at most k — 1 split productions; and if there 
are at most k— 1 split productions, a derivation tree can contain at most 2{k— 1) +1 
segments. Since G is two-phased, this entails a bound on the number of phases in 
derivation trees. In particular, the number of types of push edges is bounded as 
well. A positive phase in which all push edges have equal type is called a (positive) 
slice. Observe that if in some positive phase, the push edges e and / have the same 
type, then every push edge between e and / must also have this type. This means 
each positive phase decomposes into a bounded number of positive slices. 

Consider a derivation tree t for G and a decomposition of each segment into < 2 
phases. In the same way, we assume a decomposition of each positive phase in t 
into positive slices. Note that each pop edge is connected by an arrow to a unique 
push edge. Therefore, the type of a pop edge is the positive slice containing this 
push edge. A negative phase in which all pop edges have equal type is called a 
(negative) slice. As above, we can argue that each negative phase decomposes into 
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a b 


Figure 3. Derivation tree with arrow collection. Solid edges rep¬ 
resent chain edges. Each chain edge has a label x.y.z , where x , y, 
z indicate the segment, phase, and slice to which the edge belongs. 
S', A, B, C are nonterminal symbols, /, g , h are index symbols, and 
a, 6, c, d are terminal symbols. 


a bounded number of negative slices. For an example of a derivation tree and its 
decomposition into segments, phases, and slices, see Figure 0. 

By a simple modification to G , we may assume that there is a chain edge (i) di¬ 
rectly below the root node and (ii) directly below each node created by a split 
production. In other words, at the beginning of the derivation as well as directly 
below each node created by a split production, a segment begins. 
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Figure 4. Slice tree arising from the derivation tree in Figure 0. 

Each edge label indicates the slice that induces the edge. 

Slice trees. The decomposition of segments into a bounded number of slices gives 
rise to the concept of slice trees. A slice tree is a tree together with an arrow 
collection A , such that the following holds: 

(i) For each arrow a, vq (a) is an ancestor of (a) (in other words, there is a 
path from the root to a leaf that contains (a) and i'i(a) such that i/q (a) 
is closer to the root). 

(ii) Every edge e is either (1) a positive edge , meaning that there is at least one 
arrow leaving e and no arrow arriving in e or (2) a negative edge, meaning 
that there is precisely one arrow arriving in e and no arrow leaving in e, 
or (3) there is no arrow leaving or arriving in e. 

(iii) If e is a positive edge, then for every path from e to a leaf, there is an arrow 
from e arriving on this path. 

(iv) On each path from the root to a leaf, the arrows are well-nested, meaning 
there is no subsequence of edges e, f,g,h and an arrow from e to g and an 
arrow from / to h. 

To each derivation tree t of G, we associate a slice trees t as follows. We choose a 
decomposition of i’s segments into phases and then a decomposition of phases into 
slices. We delete all nodes with label in T U {e} (in other words, all leaves) and we 
merge each slice (positive or negative) down to a single edge. Now, the only edges 
that do not result from merging a slice are those created by split productions. 
Because of our modification, there is a slice edge directly below, so that we can 
merge them with this slice edge below. The arrows in t arise from the arrows in t: 
If there are arrows from one slice to another in t, then we add an arrow between 
the corresponding edges in t. This completes the description of the slice tree i. As 
an example, the derivation tree in Figure 0 results in the slice tree in Figured- 

Note that there is a one-to-one correspondence between the edges of the slice tree 
i and the slices of t. Moreover, the branching nodes of t correspond to applications 
of split productions in t; and degree-one nodes in t correspond to nodes that are 
incident to two slices. Since the edges in t are in correspondence with the slices of 
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t, we also call them slices. Furthermore, since we have seen above that we have a 
bound on the number of slices in a derivation tree for G, we have an upper bound 
for the size of all slice trees of derivation trees for G. 

We are now ready to prove Lemma [22. 


Proof of Lemma 0. We may assume that every terminal production in G is of the 
form A —> e. Fix a slice tree t. We will construct a finite index scattered context 
grammar that generates all words in L(G) that have a derivation tree with slice tree 
t. Since we have a bound on the size of slice trees for derivation trees of G and the 
languages generated by finite index scattered context grammars are closed under 
finite unions, this clearly suffices. 

Consider a derivation with slice tree t. Then each slice s starts in a certain non¬ 
terminal X s and ends in a nonterminal Y s . These satisfy the following conditions: 

(51) If a slice S 2 is a direct descendant of si, then X S2 = Y Sl . 

(52) If a; is a node with two children, s is the slice above x and si,S 2 are the 
slices below x, then there is a production Y s —» X S1 X S2 in G. 

(53) If s is the slice below t’s root (note that i’s root has degree 1), then X s is 
the start symbol S of G. 

(54) If s is a slice whose lower node is a leaf of t, then there is a production 
Y s — > e in G. 


Since there are only finitely many ways to choose th e non te rmina ls X s and Y s for 
each slice s, we may assume a fixed choice such that IfSIll to lfS4ll are satisfied and 
construct a grammar that simulates all derivations in which this choice occurs. 

The nonterminals of our grammar G' are pairs (s, A) 1 where s is a slice and A is 
a nonterminal of G. The idea behind the construction of G' is that each sentential 
form contains one pair (s,H) for each slice s. This nonterminal creates the same 
output as the slice s in the derivation of G. In order to simulate the index words, 
we have production sequences that simulate each push production in parallel to all 
matching pop productions. This is possible since all the index symbols pushed in 
some positive slice s are popped in those negative slices Si,..., s n for which there 
are arrows from s to Si,..., s n . 

Hence, positive slices are simulated in the same order as their productions are 
applied in G and negative slices are simulated in reverse. Therefore, we define 
Z s = X s if s is a positive slice and Z s = Ys if s is a negative slice. 

The grammar G' begins each derivation by producing a string f t that is defined 
inductively by describing f u for every subtree u of t. Let r be the root node of u. 
If r has no children, then f u = s. If r has one child, then we denote the subtree 
under r’s child node by u' and define f u = (s, Z s )f u t, where s is the slice incident 
to r. If r has two children c\ and C 2 , then we denote the trees under c\ and c-i by 
Mi and U 2 and define 

fu = (si, Z Sl )f Ul (s2, Z S2 )f U2 . 


In other words, we perform a pre-order traversal and when we walk along a slice s, 
we append (s, Z s ) on the right. 

Let us describe the production sequences in G'. First of all, we have a sequence 
that produces the word / t , namely (S —>• ft). In order to simulate the creation of 
a stack symbol in a positive slice s that is consumed in s’s corresponding negative 
slices Si,... ,s n , we have the sequence 


((s, A) —> (s,B),(s 1 ,B 1 ) —> (si, Hi),... (s„, B n ) —> (s n , A n )) 
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for each / £ I such that there are productions A —> Bf and Aif —>• Bi in G 
for A,B £ N, and A^Bi £ N for 1 < i < n. In order to simulate the output 
production A —> vB , we have a sequence ((s, A) —> v(s, B)) for each positive slice 
s and a sequence ((s,B) —> (s, A)v) for each negative slice s. 

Finally, we need sequences that remove the nonterminals if they match the tar¬ 
get (initial) nonterminal chosen for the respective positive slice (negative slice). 
Hence, we add the sequence ((s, Y s ) — y e) for each positive slice s and the sequence 
((s,X s ) —>• e) for each negative slice s. It is clear from the construction that then 
L(G') = L(G). □ 

Acknowledgements. The author would like to thank Sylvain Schmitz, who pointed 
out to him that Jullien [23j was the first to characterize downward closed languages 
by simple regular expressions. 
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